Method and device for recognizing content

ABSTRACT

Provided is a method, performed by a device, of recognizing content, the method including selecting a template to be used for recognizing content displayed on the device when a type of a content service that provides the content to the device is determined; capturing a screen of the device when a control signal is received by the device; determining whether the captured screen corresponds to a gateway page; determining whether the captured screen corresponds to the selected template when the captured screen corresponds to the gateway page; and generating a template corresponding to the content service, based on the captured screen, when the captured screen does not correspond to the selected template.

This application is the U.S. national phase of International ApplicationNo. PCT/KR2017/002752 filed 14 Mar. 2017, which designated the U.S. andclaims priority to KR Patent Application No. 10-2016-0031690 filed 16Mar. 2016, the entire contents of each of which are hereby incorporatedby reference.

TECHNICAL FIELD

The present disclosure relates to a method, performed by a device, ofidentifying content, a method, performed by a server, of identifyingcontent, a device for identifying content, and a server for identifyingcontent.

BACKGROUND ART

To provide an advertisement customized to meet the needs of variousconsumers, an advertisement provider needs to know what content a useris consuming.

A fingerprint-based content recognition technology according to therelated art extracts a video or audio fingerprint from content currentlybeing reproduced by a display device such as a TV, transmits theextracted fingerprint to a server, and matches the transmittedfingerprint with reference data of a database of the server to recognizecontent. Based on these results, it is possible to analyze the contentconsumption or the viewing pattern of the display device, and anadvertiser may effectively provide a customized advertisement based on aresult of the analysis.

However, the fingerprint-based content recognition technology causes,for the display device, additional computation or a data transfer loadduring a process of extracting the fingerprint from the displayedcontent and transmitting the fingerprint to the server, as well as alarge economical load due to additional hardware, such as a need tooperate a separate database server and a matching server. Accordingly, aneed has arisen for a technology capable of quickly and easily detectinginformation about content being reproduced by the display device.

DESCRIPTION OF EMBODIMENTS Technical Problem

Provided are a method of controlling a device for providing contentcapable of more efficiently acquiring information of content beingviewed by a user of the device, by acquiring the information of thecontent being viewed by the user through a captured screenshot of thedevice, the device, and a server.

Solution to Problem

According to an aspect of the present disclosure, a method ofrecognizing content by a device includes selecting a template to be usedfor recognizing content displayed on the device when the type of acontent service that provides the content to the device is determined;capturing a screen of the device when a control signal is received bythe device; determining whether the captured screen corresponds to agateway page; determining whether the captured screen corresponds to theselected template when the captured screen corresponds to the gatewaypage; and generating a template corresponding to the content service onthe basis of the captured screen when the captured screen does notcorrespond to the selected template.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a conceptual diagram for explaining a device for recognizingcontent, according to an embodiment.

FIG. 2 is a flowchart for explaining a method, performed by a device, ofrecognizing content, according to an embodiment.

FIG. 3 is a flowchart for explaining a method, performed by a device, ofdetecting a gateway page, according to an embodiment.

FIG. 4 is a diagram for explaining a method, performed by a device, ofdetecting a gateway page, according to an embodiment.

FIG. 5 is a diagram for explaining a first template generated by adevice with respect to a content service, according to an embodiment.

FIG. 6 is a diagram for explaining a second template generated by adevice with respect to a content service, according to anotherembodiment.

FIG. 7 is a flowchart for more specifically explaining a method,performed by a device, of recognizing content according to anembodiment.

FIG. 8 is a flowchart for explaining a method, performed by a device, ofdetecting text corresponding to content recognition information using ameaning recognition model from a captured screen, according to anembodiment;

FIG. 9 is a diagram for explaining a method, performed by a device, ofupdating a generated template, according to an embodiment.

FIG. 10 is a block diagram of a device according to an embodiment.

FIG. 11 is a block diagram of a device according to another embodiment.

BEST MODE

According to an aspect of the present disclosure, a method ofrecognizing content by a device includes selecting a template to be usedfor recognizing a content displayed on the device when the type of acontent service that provides a content to the device is determined;capturing a screen of the device when a control signal is received bythe device; determining whether the captured screen corresponds to agateway page; determining whether the captured screen corresponds to theselected template when the captured screen corresponds to the gatewaypage; and generating a template corresponding to the content service onthe basis of the captured screen when the captured screen does notcorrespond to the selected template.

The method may further include capturing a screen of the device in apredetermined time unit from a time when the control signal is received.

The method may further include comparing a plurality of screens andselecting a screen satisfying a predetermined condition as a screencorresponding to the gateway page from among the plurality of screens.

The generating of the template may include recognizing a layout ofcomponents included in the captured screen, wherein the templatedisplays default components corresponding to the components based on therecognized layout.

The generating of the template may include recognizing a text region onthe captured screen, wherein the template includes information about aposition of the recognized text region and an arrangement.

The method may further include: determining the type of the contentservice providing the content, wherein the selecting of the templateincludes: when a template corresponding to the type of the contentservice is not stored in the device, selecting a predetermined basictemplate as the template corresponding to the type of the contentservice.

The method may further include: detecting a text region from thecaptured screen by using the generated template; and recognizing contentdisplayed on a screen of the device, based on text detected from thedetected text region.

The method may further include: detecting text determined as contentrecognition information from the detected text by comparing the detectedtext and at least one text included in a predetermined meaningrecognition model.

The method may further include: updating the template based on aposition of the text determined as the content recognition informationand displayed on the captured screen.

According to another aspect of the present disclosure, a deviceincludes: a display configured to display content provided from acontent service; an input unit configured to receive a control signalfor controlling the content provided from the content service; and acontroller configured to select a template used for recognizing thecontent displayed on the device when a type of the content service thatprovides the content to the device is determined, determine whether acaptured screen corresponds to a gateway page including contentrecognition information when the control signal is received by thedevice, determining whether the captured screen corresponds to theselected template when the captured screen corresponds to the gatewaypage, and generate a template corresponding to the content service basedon the captured screen when the captured screen does not correspond tothe gateway page.

The controller may be configured to capture a screen of the device in apredetermined time unit from a time when the control signal is received.

The controller may be configured to compare a plurality of screens andselect a screen satisfying a predetermined condition as a screencorresponding to the gateway page from among the plurality of screens.

The controller may be further configured to recognize a layout ofcomponents included in the captured screen, wherein the templatedisplays default components corresponding to the components based on therecognized layout.

The controller may be further configured to recognize a text region onthe captured screen, wherein the template includes information about aposition of the recognized text region and an arrangement.

The controller may be further configured to determine the type of thecontent service providing the content, and, when a templatecorresponding to the type of the content service is not stored in thedevice, select a predetermined basic template as the templatecorresponding to the type of the content service.

The controller may be further configured to detect a text region fromthe captured screen by using the generated template and recognizecontent displayed on a screen of the device, based on text detected fromthe detected text region.

The controller may be further configured to detect text determined ascontent recognition information from the detected text by comparing thedetected text and at least one text included in a predetermined meaningrecognition model.

The controller may be further configured to update the template based ona position of the text determined as the content recognition informationand displayed on the captured screen.

Mode of Disclosure

Terms used in this specification will now be briefly described beforedescribing embodiments in detail.

Although the terms used in the disclosure have been described in generalterms that are currently used in consideration of the functions referredto in the disclosure, they are intended to encompass various other termsdepending on the intent of those of skill in the art, precedents, or theemergence of new technology. Accordingly, the terms used in thedisclosure are not defined based on the meaning of the term, not on thename of a simple term, but on the contents throughout the disclosure.

It will be understood that the terms “comprises”, “comprising”,“includes” and/or “including”, when used herein, specify the presence ofstated elements, but do not preclude the presence or addition of one ormore other elements. The term “unit” or “module” is used to denote anentity for performing at least one function or operation, and may beembodied as a software element or a hardware element or a combination ofhardware and software.

Reference will now be made in detail to embodiments, examples of whichare illustrated in the accompanying drawings, wherein like referencenumerals refer to like elements throughout. In this regard, the presentembodiments may have different forms and should not be construed asbeing limited to the descriptions set forth herein. In the followingdescription, for clarity, parts or elements that are not related to theembodiments are omitted.

FIG. 1 is a conceptual diagram for explaining a device 100 forrecognizing content, according to an embodiment.

As shown in FIG. 1, the device 100 may be a TV, but this is merely anexample and may be implemented in an electronic device that includes adisplay. For example, the device 100 may be implemented as a variety ofelectronic device such as a mobile phone, a tablet PC, a digital camera,a camcorder, a laptop computer, a tablet PC, a desktop, an electronicbook terminal, a digital broadcasting terminal, a PDA (Personal DigitalAssistants) Player, a PMP, navigation, an MP3 player, a wearable device,and the like. In particular, the embodiments may be easily implementedin a display device having a large display such as a TV, but are notlimited thereto. Also, the device 100 may be a fixed or mobile type, andmay be a digital broadcast receiver capable of receiving digitalbroadcasting.

The device 100 according to an embodiment may be provided with contentthrough a content service that provides at least one content. Forexample, the device 100 may receive content from a content serviceproviding device 10 that provides the content service. The contentservice providing device 10 according to an embodiment may be anexternal server or a device such as a set-top box. Also, the contentservice may include a real-time broadcast content service provided by aterrestrial broadcast, a cable broadcast set-top box, an IPTV set-topbox, or the like, as well as a content service in the form of a webapplication such as Netflix and Youtube. However, the content service isnot limited thereto.

Meanwhile, the device 100 according to an embodiment may identify a typeof the content service when the content service is executed by thedevice 100. For example, the device 100 may identify the type of thecontent service by recognizing a set-top box that provides the contentservice. According to another example, the device 100 may identify thetype of content service by recognizing a web application that isexecuted for the content service. According to another example, thedevice 100 may identify the type of the content service through an EPG(Electronic Program Guide) signal received together with the content.

The device 100 according to an embodiment may select a templatecorresponding to the identified type of the content service when thetype of the content service is identified. Here, the template is usedfor recognizing the content, and may be a page having a layoutcorresponding to a gateway page including content recognitioninformation such as a title and a genre of the content. Also, thegateway page may be, for example, a page displayed to provideinformation about the content to a user of the device 100 before thecontent is reproduced. Meanwhile, in this specification, a page mayindicate a frame displayed on a screen of the device 100.

The device 100 according to an embodiment may capture a screen of thedevice 100 at a predetermined period when a control signal forcontrolling the content is received. The device 100 may compare aplurality of captured screens and detect a screen corresponding to agateway page from among the plurality of screens. Also, the device 100may compare the screen detected as the gateway page and the selectedtemplate to determine whether the detected screen corresponds to theselected template.

When the detected screen does not correspond to the selected template,the device 100 may generate the template corresponding to the detectedscreen. This will be described later in detail with reference to FIG. 2.The device 100 may detect the content recognition information from thedetected screen using the generated template. Further, according toanother example, when the detected screen corresponds to the selectedtemplate, the device 100 may detect the content recognition informationfrom the detected screen based on the template. The device 100 mayrecognize the content displayed on the screen of the device 100 based onthe detected content identification information.

Meanwhile, the device 100 may transmit information about the recognizedcontent to an external server 200 that analyzes a user viewing pattern.The server 200 may analyze at least one user viewing pattern based onthe information about the content received from at least one user device(e.g., 100). Here, the viewing pattern may include information about achannel and a kind of content preferred by the user and the like.

FIG. 2 is a flowchart for explaining a method, performed by the device100, of recognizing content, according to an embodiment.

In operation S210, the device 100 selects a template used forrecognizing content displayed on the device 100 when a type of thecontent service providing the content to the device 100 is determined.

The device 100 according to an embodiment may determine the type ofcontent service that provides the content to the device 100. Forexample, the device 100 may recognize a logo of a set-top box displayedon a screen of the device 100 when the set-top box connected to thedevice 100 is turned on, to execute the content service. The device 100may determine the type of the content service through the recognizedlogo. According to another example, when a web application is executedfor execution of the content service in the device 100, the device 100may recognize a type of the web application to be executed, therebydetermining the type of the content service provided to the device 100.According to another example, in the case of a terrestrial broadcastservice, the device 100 may determine the type of the content servicethrough an EPG signal received together with the content.

The device 100 according to an embodiment may select the templatecorresponding to the determined type of content service. The device 100may store a template corresponding to each of at least one contentservice. For example, the device 100 may store a template correspondingto a first content service provided through an A web application and atemplate corresponding to a second content service provided through a Bset-top box.

Meanwhile, according to another example, when the template correspondingto the determined type of content service is not stored in the device100, the device 100 may select a predetermined basic template as thetemplate corresponding to the determined type of the content servicehave. Here, the basic template may be set to one frame that does notinclude information. However, this is an embodiment only, and the device100 may generate a flag indicating that the template is not stored whenthe template corresponding to the determined type of the content serviceis not stored.

In operation S220, when a control signal is received, the device 100determines whether a captured screen corresponds to a gateway pageincluding content identification information. Here, the contentrecognition information may include text that may identify a title, agenre, etc. of the content.

The device 100 according to an embodiment may receive the control signalfor controlling at least one piece of content provided by the device100. For example, the device 100 may receive the control signal forcontrolling the at least one content provided by the device 100 from aninput device. However, this is only an embodiment, and according toanother example, the device 100 may receive a control signal in the formof touch or gesture from a user. Here, the control signal may includeone of a channel change signal of the device 100, a power on signal ofthe device 100, a connection signal between another device and thedevice 100, a menu selection signal of the device 100, and a requestsignal for at least one piece of content information.

The device 100 according to an embodiment may capture a screen of thedevice 100 in a predetermined time unit when the control signal isreceived. A plurality of screens captured by the device in thepredetermined unit may include at least one of a screen in which acontent control menu for content control is displayed, a screen in whicha gateway page for introducing the content is displayed before thereproduction of specific content is started, and a content displayedscreen.

The device 100 according to an embodiment may select a screen thatsatisfies a predetermined condition from among the plurality of screensas a screen corresponding to the gateway page. For example, the device100 may compare the plurality of screens and select a previous screen ofa screen from which a component including the content recognitioninformation is removed.

In operation S230, the device 100 determines, based on a result ofdetermination, whether the captured screen corresponds to the selectedtemplate.

The device 100 according to an embodiment may compare the capturedscreen with a layout of the selected template to determine whether thecaptured screen corresponds to the selected template. Here, the layoutmay represent a format in which components of at least one image unitincluded in the screen are arranged. For example, when the capturedscreen has a layout including a first image in the upper left and asecond image in the lower right, the device 100 may determine whetherthe selected template also has the same layout as the captured screen.

In operation S240, when the captured screen does not correspond to theselected template, the device 100 generates a template corresponding tothe content service based on the captured screen.

When the template corresponding to the determined type of contentservice is not stored in the device 100 according to an embodiment and abasic template is set or a flag is generated, the device 100 maydetermine that the captured screen does not match the selected template.Accordingly, the device 100 may generate the template based on thecaptured screen.

The device 100 according to an embodiment may generate the templatehaving the layout corresponding to the captured screen. For example, thedevice 100 may generate the template in which a layout of defaultcomponents is set in correspondence to a layout of components on thescreen corresponding to the gateway page. Here, the default componentsmay have size and shape corresponding to components included in thescreen corresponding to the captured screen.

Meanwhile, the device 100 according to an embodiment may detect thecontent recognition information from the screen corresponding to thecaptured screen by comparing the generated template with the capturedscreen. For example, the device 100 may recognize a text region in whichtext is displayed from the captured screen and read text by applying OCRtechnology to the recognized text region, thereby recognizing thecontent displayed on the screen of the device 100.

FIG. 3 is a flowchart for explaining a method, performed by the device100, of detecting a gateway page, according to an embodiment.

In operation S310, the device 100 selects a template used forrecognizing content displayed on the device 100 when a type of thecontent service providing the content to the device 100 is determined.

Meanwhile, operation S310 may correspond to operation S210 describedabove with reference to FIG. 2.

In operation S320, the device 100 may capture a screen of the device 100in a predetermined time unit when a control signal is received.

The device 100 may capture the screen of the device 100 on which thecontent is displayed at a predetermined period, to detect the gatewaypage. For example, when a user of the device 100 transmits the controlsignal to the device 100 to select any of a plurality of contentsprovided to the device 100, the device 100 may capture the screen of thedevice 100 in which the content is displayed at the predetermined periodfrom a time when the control signal is received.

In operation S330, when a component displayed on a first screen isremoved from a second screen as a result of comparing a captured firstscreen and a second screen, the device 100 may detect the first screenas the gateway page. Here, the component is at least part of informationconstituting the gateway page displayed before the content isreproduced, to provide information about the content. For example, thecomponent may include an image box displaying text relating to a plot ofthe content, an image box displaying text relating to a title of thecontent, and an image box displaying text relating to a viewing ratingof the content.

The device 100 according to an embodiment may compare a plurality ofcaptured screens to select a previous screen of a screen on which thecomponent is not displayed.

For example, the device 100 may compare the first screen captured at thetime when the control signal is received and the second screen capturedafter a predetermined first time from the time when the control signalis received. The device 100 may detect the first screen as the gatewaypage when the component displayed on the first screen is not displayedon the second screen as a result of comparing the first screen and thesecond screen.

In operation S340, the device 100 may determine whether the first screencorresponds to the template.

The device 100 according to an embodiment may compare the first screenand a layout of the selected template to determine whether the firstscreen and the selected template correspond to each other.

In operation S350, when the first screen does not correspond to theselected template, the device 100 may generate the template based on thefirst screen.

Meanwhile, operation S350 may correspond to operation S240 describedabove with reference to FIG. 2.

FIG. 4 is a diagram for explaining a method, performed by the device100, of detecting a gateway page, according to an embodiment.

The device 100 according to an embodiment may receive a control signalat time t1 to select any one of a plurality of content provided to thedevice 100. The device 100 may capture a screen of the device 100 inwhich content is displayed in a predetermined time unit from the time t1when the control signal is received. For example, the device 100 maycapture the screen of the device 100 on which the content is displayedat time t1, t2, and t3, respectively. In FIG. 4, screens captured at thetime t1, t2, and t3 will be described as a first screen 410, a secondscreen 420, and a third screen 430, respectively.

The device 100 according to an embodiment may compare the second screen420 and the third screen 430 to detect the gateway page of the content.For example, when a component 425 displayed on the second screen 420 isremoved from the third screen 430 as a result of comparing the firstscreen 410, the second screen 420, and the third screen 430, the device100 may detect the second screen 420 as the gateway page.

Also, according to another example, as a result of comparing the firstscreen 410, the second screen 420, and the third screen 430, the device100 may detect the second screen 420 captured before the third screen430 from which a content control menu 415 displayed on the first screen410 disappears and on which the content is displayed, as the gatewaypage. Here, the content control menu 415 may include a user interfacefor selecting any one of at least one piece of content provided from acontent service.

FIG. 5 is a diagram for explaining a first template generated by thedevice 100 with respect to a content service, according to anembodiment.

The device 100 according to an embodiment may determine a type of thecontent service that provides content to the device 100. For example,the device 100 may determine the type of the content service by detect alogo of a set-top box displayed on a screen of the device 100 when theset-top box is powered on. According to another example, the device 100may determine the type of the content service by determining a type of aweb application to be executed in the device 100. According to anotherexample, when a terrestrial broadcast service is received, the device100 may detect an EPG (Electronic Program Guide) received together withthe content and determine the type of the content service through thedetected EPG.

Referring to FIG. 5, the device 100 may determine the type of thecontent service that the content service providing the content to thedevice 100 is a terrestrial TV broadcasting service when the EPG isreceived by the device 100. Accordingly, the device 100 may select atemplate corresponding to the terrestrial TV broadcasting service.

Meanwhile, the device 100 according to an embodiment may select apredetermined basic template when the template corresponding to theterrestrial TV broadcasting service is not stored in the device 100.However, this is an embodiment only, and the device 100 may set a flagindicating that the template corresponding to the terrestrial TVbroadcast service is not stored in the device 100.

Also, the device 100 may capture a screen of the device 100 in apredetermined time unit from a time when a control signal is receivedwhen the control signal for selecting the content displayed on thedevice 100 is received. The device 100 according to an embodiment maycompare a plurality of captured screens to detect a gateway page amongthe plurality of captured screens. Here, a method of detecting thegateway page may correspond to the method described above in FIG. 4.

The device 100 according to an embodiment may determine whether adetected screen 510 corresponds to the selected template. Meanwhile, theselected template is a basic template selected since the templatecorresponding to the terrestrial TV broadcasting service is not storedin the device 100, and thus the detected screen 510 may not correspondto the selected template in this embodiment. Further, according toanother example, when the device 100 sets the flag indicating that thetemplate corresponding to the terrestrial TV broadcast service is notstored in the device 100, the device 100 may determine, via the setflag, that the detected screen 510 does not correspond to the selectedtemplate.

Accordingly, the device 100 according to an embodiment may generate afirst template 520 corresponding to the terrestrial TV broadcastingservice based on the detected screen 510. For example, the device 100may analyze a layout of the detected screen 510 to generate the firsttemplate 520 having a layout corresponding to the layout of the detectedscreen 510.

Meanwhile, the device 100 according to an embodiment may detect textcorresponding to content recognition information from the detectedscreen 510 using the generated first template 520. The device 100 mayrecognize text displayed on the screen of the device 100 based on thedetected text.

FIG. 6 is a diagram for explaining a second template 620 generated bythe device 100 with respect to a content service according to anotherembodiment.

Referring to FIG. 6, the device 100 may determine a type of the contentservice that the content service providing content to the device 100 isan A Web application service by identifying a web application that isrunning in the device 100. Accordingly, the device 100 may select atemplate corresponding to the A Web application service.

Meanwhile, the device 100 according to an embodiment may select apredetermined basic template when the template corresponding to the AWeb application service is not stored in the device 100. However, thisis an embodiment only, and the device 100 may set a flag indicating thatthe template corresponding to the A Web application service is notstored in the device 100.

Also, the device 100 may capture a screen of the device 100 in apredetermined time unit from a time when a control signal is receivedwhen the control signal for selecting the content displayed on thedevice 100 is received. The device 100 according to an embodiment maycompare a plurality of captured screens to detect a gateway page fromamong the plurality of captured screens. Here, a method of detecting thegateway page may correspond to the method described above in FIG. 4.

The device 100 according to an embodiment may determine whether adetected screen 610 corresponds to the selected template. Meanwhile,since the selected template is a basic template selected when a templatecorresponding to a terrestrial TV broadcasting service is not stored inthe device 100, the detected screen 610 may not correspond to theselected template in this embodiment. Further, according to anotherexample, when the device 100 sets a flag indicating that the templatecorresponding to the terrestrial TV broadcast service is not stored inthe device 100, the device 100 may determine, via the set flag, that thedetected screen 610 does not correspond to the selected template.

Accordingly, the device 100 according to an embodiment may generate thesecond template 620 corresponding to the terrestrial TV broadcastingservice based on the detected screen 610. For example, the device 100may analyze a layout of the detected screen 610 and generate the secondtemplate 620 having a layout corresponding to the layout of the detectedscreen 610.

Meanwhile, the device 100 according to an embodiment may detect textcorresponding to content recognition information from the detectedscreen 610 using the generated second template 620. The device 100 mayrecognize text displayed on the screen of the device 100 based on thedetected text.

FIG. 7 is a flowchart for more specifically explaining a methodperformed by the device 100 of recognizing content according to anembodiment.

In operation S710, the device 100 may determine a type of a contentservice that provides the content to the device 100. For example, thedevice 100 may recognize a logo A of a set-top box displayed on a screenof the device 100 when the set-top box connected to the device 100 isturned on. The device 100 may determine, via the recognized logo A, thatthe content service providing the content to the device 100 is an Acontent service provided from the A set-top box.

According to another example, the device 100 may detect this throughmetadata of a B web application when the B web application installed inthe device 100 is executed. Accordingly, the device 100 may determinethat the content service providing the content to the device 100 is a Bcontent service provided via the B web application.

According to another example, when the device 100 detects an EPG signalto be provided to the device 100 in a base station transmitting aterrestrial broadcast, the device 100 may determine that the contentservice providing the content to the device 100 may be transmitted to aC content service that is one of a plurality of terrestrial broadcastservices.

In operation S720, the device 100 may select a template when the type ofthe content service is determined.

The device 100 according to an embodiment may store a templatecorresponding to the type of the content service. When the type of thecontent service is determined, the device 100 may select the templatecorresponding to the determined type of the content service from thestored templates.

Meanwhile, when the device 100 does not store the template correspondingto the type of the content service, the device 100 may select apredetermined basic frame or generate a flag indicating that thetemplate is not stored.

In operation S730, the device 100 may receive a control signal forcontrolling the content.

In operation S740, the device 100 may capture a screen of the device 100in a predetermined time unit when the control signal is received.

In operation S750, the device 100 may determine whether the capturedscreen corresponds to a gateway page.

The device 100 according to an embodiment may compare a plurality ofscreens captured in the predetermined time unit and select a screencorresponding to the gateway page when the control signal is received.Meanwhile, a method performed by the device 100 of determining whetherthe captured screen corresponds to the gateway page may correspond tothe method described above with reference to FIG. 3.

Meanwhile, when the screen corresponding to the gateway page is notpresent among the plurality of captured screens, the device 100according to an embodiment may display the screen of the device 100 in apredetermined time unit from a time when another control signal isreceived.

In operation S760, the device 100 may determine whether the capturedscreen corresponds to the template.

The device 100 according to an embodiment may compare a layout of thetemplate with a layout of the captured screen to determine whether thecaptured screen corresponds to the template.

In operation S770, the device 100 may generate a template based on thecaptured screen.

The device 100 according to an embodiment may generate the templatebased on the captured screen when the captured screen does notcorrespond to the template.

In operation S780, the device 100 may detect a text region from thescreen captured using the template.

The device 100 according to an embodiment may detect the text regionfrom the captured screen using the selected template when the layout ofthe selected template corresponds to the layout of the captured screen.Here, it is assumed that information about the text region is preset inthe template.

Meanwhile, according to another embodiment, the device 100 may detectthe text region from the captured screen by comparing the generatedtemplate with the captured screen.

In operation S790, the device 100 may recognize text included in thedetected text region.

The device 100 according to an embodiment may recognize the text fromthe detected text region using a text reading technique such as OCRtechnology. The device 100 may extract text indicating a title and agenre of the content from the recognized text.

The device 100 according to an embodiment may recognize the contentdisplayed on the screen of the device 100 based on the extracted text.

FIG. 8 is a flowchart for explaining a method performed by the device100 of detecting text corresponding to content recognition informationusing a meaning recognition model from a captured screen according to anembodiment.

In operation S810, the device 100 may select a template to be used forrecognizing content displayed on the device 100 when a type of a contentservice providing the content to the device 100 is determined.

Meanwhile, operation S810 may correspond to operation S210 describedabove with reference to FIG. 2.

In operation S820, the device 100 may capture a screen of the device 100and determine whether the captured screen corresponds to a gateway pagewhen a control signal is received.

Meanwhile, operation S820 may correspond to operation S220 describedabove with reference to FIG. 2.

In operation S830, the device 100 may determine whether the capturedscreen corresponds to the selected template when the captured screencorresponds to the gateway page.

Meanwhile, operation S830 may correspond to operation S230 describedabove with reference to FIG. 2.

In operation S840, when the captured screen does not correspond to theselected template, the device 100 may generate a template based on thecaptured screen.

Meanwhile, operation S840 may correspond to operation S240 describedabove with reference to FIG. 2.

In operation S850, the device 100 may detect a text region from thescreen captured using the generated template.

In operation S860, the device 100 may recognize text included in thedetected text region.

In operation S870, the device 100 may detect text corresponding tocontent recognition information by comparing the recognized text with atleast one text included in the meaning recognition model.

The device 100 according to an embodiment may preset the meaningrecognition model. The meaning recognition model may include at leastone text for recognizing the content. For example, the meaningrecognition model may include at least one text indicating a title ofcurrently provided content and a channel name or number, etc., in whichthe content is being provided.

Further, according to another example, the meaning recognition model maybe set differently according to an ID of the device 100 and an ID of auser. For example, when the user is a twenties female, the device 100may select a meaning recognition model including at least one textindicating a title of content preferred by the twenties female and achannel type among a plurality of meaning recognition models.

The device 100 according to an embodiment may detect text included in acharacter string using a predetermined format pattern with respect to atemplate screen from the extracted character string. For example, when aformat pattern indicating recognition information about content is setthat the title name of the content is displayed next to the channelname, the device 100 may determine that text after the channel namecorresponds to the title name of the content and detect the text afterthe channel name.

Meanwhile, according to another example, the at least one text includedin the extracted character string may not correspond to a predeterminedformat pattern with respect the template. In this case, the device 100may use a probability model that probabilistically calculates therelationship between texts representing recognition information about aplurality of pieces of contents provided by at least one content servicestored in the meaning recognition model to detect the text indicatingthe content recognition information. For example, the device 100 mayextract text that a name of an actor is A and a name of broadcast is Bfrom a character string that A is a leading actor and B is an exclusivebroadcast based on the probability model.

The device 100 according to another embodiment may read the extractedcharacter string and compare the at least one text included in theextracted character string with at least one text included in apredetermined meaning recognition model. Here, the at least one textincluded in the meaning recognition model may be the text representingthe content recognition information. The device 100 may extract the textcorresponding to the predetermined meaning recognition model from thereceived character string by comparing the predetermined characterrecognition model with the received character string.

In operation S880, the device 100 may update the template based on thedetected text.

The device 100 according to an embodiment may not perform reading ontexts other than the detected text when the texts other than thedetected text are not content recognition information. Thus, forexample, the device 100 may recognize a location where the texts otherthan the detected text are displayed and remove a default component on atemplate corresponding to the recognized location. Also, according toanother example, the device 100 may display an index indicating thattext regarding the title of the content is detected, with respect to thedefault component disposed in a location where the text regarding thetitle of the content is detected, among at least on default componentsdisplayed on the template.

FIG. 9 is a diagram for explaining a method performed by the device 100of updating a generated template according to an embodiment.

Referring to FIG. 9, the device 100 according to an embodiment maygenerate the template based on a captured screen 910 when the capturedscreen 910 is determined as a gateway page. In FIG. 9, it is assumedthat the template selected by the device 100 does not correspond to thecaptured screen 910.

The device 100 according to an embodiment may extract at least onecomponents 912, 914, and 916 from the captured screen 910 and generate atemplate 920 in which default components 922, 924, and 926 are disposedin locations corresponding to the extracted at least one components 912,914, and 916.

Meanwhile, the device 100 according to an embodiment may update thegenerated template 920 by comparing the generated template 920 withrecognition information of content detected from the detected gatewaypage 810. The device 100 may read text displayed on the captured screen910 and select text including the content recognition information fromthe read text. The device 100 may also select the component 912corresponding to the selected text and remove the components 924 and 926excluding the default component 922 corresponding to the selectedcomponent 912 from among the default components 922, 924, 926 displayedin the template 920 and update the template 920.

FIG. 10 is a block diagram of the device 100 according to an embodiment.

Referring to FIG. 10, the device 100 according to an embodiment mayinclude a display 110, an input unit 120, and a controller 130. However,not all illustrated components are indispensable components. The device100 may be implemented by more or less components than the illustratedcomponents.

The display 110 according to an embodiment displays content. Forexample, the display 110 may display the content received from anexternal content service providing apparatus.

The input unit 120 according to an embodiment receives a control signalfor controlling the content.

The controller 130 according to an embodiment determines a type of acontent service providing the content to the device 100. Also, thecontroller 130 selects a template used for recognizing content displayedon the device 100 when the type of the content service is determined.Meanwhile, when the template corresponding to the type of the contentservice is not stored in the device 100, the controller 130 may select apredetermined basic template as the template corresponding to thecontent service.

The controller 130 according to an embodiment captures a screen of thedevice 100 when a control signal is received and determines whether thecaptured screen corresponds to a gateway page. The controller 130 alsodetermines whether the captured screen corresponds to the selectedtemplate when the captured screen corresponds to the gateway page as aresult of determination. When the captured screen does not correspond tothe selected template, the controller 130 generates the templatecorresponding to the content service based on the captured screen.

The controller 130 according to an embodiment may capture a screen ofthe device in a predetermined time unit from a time when the controlsignal is received. Also, the controller 130 may detect the gateway pageby comparing a plurality of screens captured in the predetermined timeunit. For example, when a component displayed on a first screen isremoved from a second screen as a result of comparing the first screenand the second screen, the controller 130 may detect the first screen asthe gateway page.

The controller 130 according to an embodiment may recognize a layout ofcomponents included in the captured screen. The controller 130 maygenerate a template in which default components corresponding to thecomponents are displayed based on the recognized layout.

According to another embodiment, the controller 130 recognizes a textregion on the captured screen. The controller 130 may generate atemplate including information about a position and arrangement of therecognized text region.

The controller 130 according to an embodiment may detect the text regionfrom the captured screen using the generated template. The controller130 may recognize the content displayed on the screen of the device 100based on text detected from the detected text region.

The controller 130 according to an embodiment may detect text determinedas content recognition information in the detected text by comparing thedetected text with at least one text included in a predetermined meaningrecognition model. Also, the controller 130 may update the templatebased on a position on which the detected text is displayed on thecaptured screen.

FIG. 11 is a block diagram of a device 1100 according to anotherembodiment.

Referring to FIG. 11, the device 1100 according to an embodiment mayfurther include an audio processor 1115, an audio output unit 1125, acommunicator 1140, a tuner 1150, a power supplier 1160, a sensor 1170, avideo processor 1180, and a storage 1190, in addition to a display 1110,an input/output unit 1120, and a controller 1130.

Hereinafter, the components will be described in order.

The display 1110 displays content provided to the device 1100 from acontent service. Meanwhile, the display 1110 may correspond to thedisplay 110 described with reference to FIG. 10 above.

The display 1110 converts an image signal, a data signal, an OSD signal,a control signal, etc. processed by the controller 1130 to generatedriving signals. The display 1110 may be a PDP, an LCD, an OLED, aflexible display, or a 3D display. Also, the display 1110 may beconfigured as a touch screen and used as an input device in addition toan output device.

In the present specification, the display 1110 may be exchangeable witha screen in that the display 1110 displays the content.

The input/output unit 1120 receives video (e.g. a moving image, etc.),audio (e.g. voice, music, etc.), and additional information (e.g. EPG,etc.) from outside the device 1100 under the control of the controller1130. The input/output unit 1120 according to an embodiment may receivea control signal for controlling the content displayed on the device1100. The input/output unit 1120 may include one of a high-definitionmultimedia interface (HDMI) port 1121, a component jack 1122, a PC port1123, and a USB port 1124. The input/output unit 1120 may include acombination of an HDMI port 1121, the component jack 1122, the PC port1123, and the USB port 1124.

It will be readily understood by those of skill in the art that theconfiguration and operation of the input/output unit 1120 may bevariously implemented according to the embodiments.

The controller 1130 usually controls the overall operation of the device1100. For example, the controller 1130 may generally control the display1110, an input/output unit 1120, the audio processor 1115, the audiooutput unit 1125, the communicator 1140, the tuner 1150, the powersupplier 1160, the sensor 1170, the video processor 1180, and thestorage 1190, and the like by executing programs stored in the storage1190.

Meanwhile, the controller 1130 according to an embodiment may correspondto the controller 130 described with reference to FIG. 10 above.

The communicator 1140 may connect the device 1100 to an external device(for example, an input device, a service providing device, a server, orthe like) under the control of the controller 1130. For example, thecontroller 1130 may transmit/receive content to/from a service providingdevice connected through the communicator 1140, download an applicationfrom the service providing device, or browse the web. The communicator1140 may include one of a wireless LAN 1141, a Bluetooth 1142, and awired Ethernet 1143 according to performance and structure of the device1100. The communicator 1140 may also include a combination of thewireless LAN 1141, the Bluetooth 1142, and the wired Ethernet 1143. Thecommunicator 1140 may receive the control signal for controlling thecontent under the control of the controller 130. The control signal maybe implemented as a Bluetooth type control signal, an RF signal typecontrol signal, or a WiFi type control signal. In this case, thecommunicator 1140 may perform a function corresponding to the functionof the input/output unit 1120 described above.

The communicator 1140 may further include a near field communication(for example, near field communication (NFC), not shown) and a Bluetoothlow energy (BLE) (not shown) other than Bluetooth.

The communicator 1140 may transmit extracted content recognitioninformation to the server 200. The communicator 1140 may receive contentviewing pattern information of a user of the device 1100 determinedbased on the content information extracted from the server 200.

The audio processor 1115 performs processing on audio data. The audioprocessor 1115 may perform various kinds of processing such as decodingand amplification, noise filtering, and the like on the audio data.Meanwhile, the audio processor 1115 may include a plurality of audioprocessing modules for processing audio corresponding to a plurality ofcontents.

The audio output unit 1125 outputs audio included in a broadcast signalreceived through the tuner 1150 under the control of the controller1130. The audio output unit 1125 may output audio (e.g., voice or sound)input through the communicator 1140 or the input/output unit 1120. Theaudio output unit 1125 may also output audio stored in the storage 1190under the control of the controller 1130. The audio output unit 1125 mayinclude at least one of a speaker 1126, a headphone output terminal1127, or a Sony/Philips Digital Interface (S/PDIF) output terminal 1128.The audio output unit 1125 may include a combination of the speaker1126, the headphone output terminal 1127, and the S/PDIF output terminal1128.

The tuner 1150 may receive the broadcast signal in a frequency bandcorresponding to a channel number (e.g., cable broadcast 506) accordingto a user input (e.g., for example, the control signal, a channel numberinput, a channel up-down input, and a channel input in an EPG screen).

The tuner 1150 may receive broadcast signals from various sources suchas terrestrial broadcast, cable broadcast, satellite broadcast, internetbroadcast, and the like. The tuner 1150 may receive a broadcast signalfrom a source such as an analog broadcast or a digital broadcast. Thebroadcast signal received through the tuner 1150 is decoded (forexample, audio decoding, video decoding, or additional informationdecoding) and separated into audio, video and/or additional information.The separated audio, video, and/or additional information may be storedin the storage 1190 under the control of the controller 1130.

The tuner 1150 may tune and select only a frequency of a channel to bereceived by the display device 100 among many radio wave componentsthrough amplification, mixing, and resonance of a broadcast signalreceived by wired or wirelessly. The broadcast signal includes audio,video and additional information (for example, EPG (Electronic ProgramGuide)).

The power supplier 1160 supplies power from an external power source tointernal components of the device 1100 under the control of thecontroller 1130. Also, the power supplier 1160 may supply power to theinternal components from one or more batteries (not shown) locatedinside the device 1100 under the control of the controller 1130.

The sensor 1170 according to an embodiment may sense a user input andtransmit a sensed signal to the controller 1130. Also, the sensor 1170may sense user inputs for power on/off, channel selection,channel-up/down, and screen setup. Also, the sensor 1170 according to anembodiment may sense a user input for moving a cursor displayed on thedisplay 1110 and a direction key input for moving a focus betweencandidate items. Also, the sensor 1170 senses a user's voice, an imageof the user, or an interaction of the user.

A microphone 1171 receives a user's uttered voice. The microphone 1171may convert the received voice into an electrical signal and output theelectrical signal to the controller 1130.

The microphone 1171 may be implemented integrally with or separatelyfrom the device 100. The separated microphone 1171 may be electricallyconnected to the device 1100 through the communicator 1140 or theinput/output unit 1120. It will be readily understood by those of skillin the art that the microphone 1171 may be omitted according to theperformance and structure of the device 1100.

A camera 1172 may convert the received image into an electric signalunder the control of the controller 1130 and output the electric signalto the controller 1130.

A light receiver 1173 receives an optical signal (including the controlsignal) received from an external input device through an optical window(not shown) of a bezel of the display 1110 or the like. The lightreceiver 1173 may receive the optical signal corresponding to a userinput (e.g., touch, pressing, touch gesture, voice, or motion) from theinput device. The control signal may be extracted from the receivedoptical signal under the control of the controller 1130.

The video processor 1180 performs processing on video data received bythe device 1100. The video processor 1180 may perform various kinds ofimage processing such as decoding, scaling, noise filtering, frame rateconversion, resolution conversion, and the like on the video data.

The controller 1130 may include a RAM 1181 that stores a signal or datainput from the outside of the device 1100 or is used as a storage areacorresponding to various jobs performed by the device 1100, an ROM 1182in which a control program for controlling the device 1100 is stored,and a processor 1183.

The processor 1183 may include a graphic processing unit (not shown) forgraphics processing corresponding to the video. The processor 1183 maybe implemented as a SoC (System On Chip) incorporating a core (notshown) and a GPU (not shown). The processor 1183 may include a singlecore, a dual core, a triple core, a quad core, and multiple coresthereof.

The processor 1183 may also include a plurality of processors. Forexample, the processor 1183 may be implemented as a main processor (notshown) and a sub processor (not shown) operating in a sleep mode.

The graphics processor 1184 generates a screen including various objectssuch as an icon, an image, and a text using an operator (not shown) anda renderer (not shown). The operator calculates attribute values such asa coordinate value, a shape, a size, and a color to be displayed by eachof the objects according to a layout of the screen using the user inputsensed through the sensor 1170. The renderer generates screens ofvarious layouts including the objects based on the attribute valuescalculated by the operator. The screens generated by the renderer aredisplayed within a display area of the display 1110.

First to nth interfaces 1185-1 to 1185-n are connected to the variouscomponents described above. One of the first to nth interfaces 1185-1 to1185-n may be a network interface connected to an external device over anetwork.

The RAM 1181, the ROM 1182, the processor 1183, the graphics processor1184 and the first through nth interfaces 1185-1 through 1185-n may beinterconnected via an internal bus 1186.

The term controller in the present embodiment includes the processor1183, the ROM 1182, and the RAM 1181.

The storage 1190 may store various data, programs, or applications fordriving and controlling the device 1100 under the control of thecontroller 1130. For example, the storage 1190 may store a controlprogram for controlling the device 1100 and the controller 1130, anapplication initially provided from a manufacturer or downloaded fromoutside, a graphical user interface (GUI) related to the application,objects (e.g., image text, icons, buttons, etc.) for providing the GUI,user information, documents, databases, or related data.

In an embodiment, the term storage includes a memory card (e.g., microSD card, USB memory, not shown) installed in the storage 1190, the ROM1182 and the RAM 1181 of the controller 1130, or the device 1100. Also,the storage 1190 may include a nonvolatile memory, a volatile memory, ahard disk drive (HDD), or a solid state drive (SSD).

The storage 1190 may include a broadcast receiving module, a channelcontrol module, a volume control module, a communication control module,a voice recognition module, a motion recognition module, a lightreceiving module, a display control module, an audio control module, anexternal input control module, a power control module, a power controlmodule of an external device connected wirelessly (e.g. Bluetooth), avoice database (DB), or a motion database (DB) that is not shown. Themodules and database of the storage 1190 that are not shown may beimplemented in the form of software to perform a control function ofbroadcast reception by the device 1100, a channel control function, avolume control function, a communication control function, a voicerecognition function, a motion recognition function, a light receptioncontrol function, a display control function, an audio control function,an external input control function, a power control function, or a powercontrol function of an external device connected wirelessly (e.g.,Bluetooth). The controller 1130 may perform the respective functionsusing the software stored in the storage 1190.

The method according to an embodiment may be implemented as computerinstructions which may be executed by various computer means, andrecorded on a non-transitory computer-readable recording medium. Thenon-transitory computer-readable recording medium may include programinstructions, data files, data structures, or a combination thereof. Theprogram instructions recorded on the non-transitory computer-readablerecording medium may be specially designed and constructed for thepresent disclosure or may be known to and usable by one of ordinaryskill in a field of computer software. Examples of the non-transitorycomputer-readable medium include magnetic media such as hard discs,floppy discs, or magnetic tapes, optical media such as compact disc-readonly memories (CD-ROMs), or digital versatile discs (DVDs),magneto-optical media such as floptical discs, and hardware devices thatare specially configured to store and carry out program instructions,such as ROMs, RAMs, or flash memories. Examples of the programinstructions include a high-level programming language that may beexecuted by a computer using an interpreter as well as a machinelanguage code made by a compiler.

The device described herein may include a processor, a memory forstoring program data to be executed by the processor, a permanentstorage such as a disk drive, a communications port for handlingcommunications with external devices, and user interface devices,including a display, keys, etc. When software modules are involved,these software modules may be stored as program instructions or computerreadable code executable by the processor on a non-transitorycomputer-readable media such as read-only memory (ROM), random-accessmemory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical datastorage devices. The computer readable recording media may also bedistributed over network coupled computer systems so that the computerreadable code is stored and executed in a distributed fashion. Thismedia may be read by the computer, stored in the memory, and executed bythe processor.

For the purposes of promoting an understanding of the principles of thedisclosure, reference has been made to the embodiments illustrated inthe drawings, and specific language has been used to describe theseembodiments. However, no limitation of the scope of the disclosure isintended by this specific language, and the disclosure should beconstrued to encompass all embodiments that would normally occur to oneof ordinary skill in the art.

The disclosure may be described in terms of functional block componentsand various processing steps. Such functional blocks may be realized byany number of hardware and/or software components configured to performthe specified functions. For example, the disclosure may employ variousintegrated circuit components, e.g., memory elements, processingelements, logic elements, look-up tables, and the like, which may carryout a variety of functions under the control of one or moremicroprocessors or other control devices. Similarly, where the elementsof the disclosure are implemented using software programming or softwareelements, the disclosure may be implemented with any programming orscripting language such as C, C++, Java, assembler, or the like, withthe various algorithms being implemented with any combination of datastructures, objects, processes, routines or other programming elements.Functional aspects may be implemented in algorithms that execute on oneor more processors. Furthermore, the disclosure may employ any number oftechniques of the related art for electronics configuration, signalprocessing and/or control, data processing and the like. The words“mechanism” and “element” are used broadly and are not limited tomechanical or physical embodiments, but may include software routines inconjunction with processors, etc.

The particular implementations shown and described herein areillustrative examples of the disclosure and are not intended tootherwise limit the scope of the disclosure in any way. For the sake ofbrevity, conventional electronics, control systems, software developmentand other functional aspects of the systems (and components of theindividual operating components of the systems) may not be described indetail. Furthermore, the connecting lines, or connectors shown in thevarious figures presented are intended to represent exemplary functionalrelationships and/or physical or logical couplings between the variouselements. It should be noted that many alternative or additionalfunctional relationships, physical connections or logical connectionsmay be present in a practical device. Moreover, no item or component isessential to the practice of the disclosure unless the element isspecifically described as “essential” or “critical”. It will berecognized that the terms “comprising,” “including,” and “having,” asused herein, are specifically intended to be read as open-ended terms ofart.

The use of the terms “the” and similar referents in the context ofdescribing the disclosure (especially in the context of the followingclaims) are to be construed to cover both the singular and the plural.Furthermore, recitation of ranges of values herein are merely intendedto serve as a shorthand method of referring individually to eachseparate value falling within the range, unless otherwise indicatedherein, and each separate value is incorporated into the specificationas if it were individually recited herein. Finally, the steps of allmethods described herein may be performed in any suitable order unlessotherwise indicated herein or otherwise clearly contradicted by context.The use of any and all examples, or exemplary language (e.g., “such as”)provided herein, is intended merely to better illuminate the disclosureand does not pose a limitation on the scope of the disclosure unlessotherwise claimed. It will also be appreciated by those skilled in theart that various modifications, combinations, and alterations may bemade depending on design criteria and factors within the scope of theappended claims or equivalents thereof.

The invention claimed is:
 1. A method, performed by a device, ofrecognizing content, the method comprising: selecting a template usedfor recognizing content displayed by the device, based on a type of acontent service that provides the content to the device; capturing atleast a first screen of displayed content and a second screen ofdisplayed content, based on the device receiving a control signal;comparing the captured first screen to the captured second screen todetermine whether the captured first screen corresponds to a gatewaypage comprising content recognition information; and based ondetermining that the captured first screen corresponds to the gatewaypage: determining whether the captured first screen corresponds to theselected template; extracting content recognition information from thecaptured first screen using the selected template, based on determiningthat the captured first screen corresponds to the selected template; andgenerating a template corresponding to the content service, based on alayout of components included in the captured first screen, based ondetermining that the captured first screen does not correspond to theselected template.
 2. The method of claim 1, wherein the generating ofthe template comprises: recognizing the layout of the componentsincluded in the captured first screen, wherein the generated templatecomprises default components corresponding to the components, based onthe recognized layout.
 3. The method of claim 1, wherein the generatingof the template comprises: recognizing a text region included in thecaptured first screen, wherein the generated template comprisesinformation about a position and arrangement of the recognized textregion.
 4. The method of claim 1, further comprising: determining thetype of the content service that provides the content to the device,wherein the selecting of the template comprises: based on a templatecorresponding to the type of the content service not being stored in thedevice, selecting, as the template used for recognizing content, apredetermined template.
 5. The method of claim 1, further comprising:detecting a text region from the captured first screen by using thegenerated template; and recognizing displayed content, based on textdetected from the detected text region.
 6. The method of claim 5,further comprising: detecting content recognition information from thedetected text by comparing the detected text and at least one textincluded in a meaning recognition model.
 7. The method of claim 6,further comprising: updating the generated template based on a displayposition of the content recognition information.
 8. A device comprising:a display; an input device; and a processor configured to: select atemplate used for recognizing content displayed on the display, based ona type of the content service that provides the content to the device;control to capture at least a first screen of displayed content and asecond screen of displayed content, based on a control signalcorresponding to an input to the input device; compare the capturedfirst screen to the captured second screen to determine whether thecaptured first screen corresponds to a gateway page comprising contentrecognition information; and based on determining that the capturedfirst screen corresponds to the gateway page: determine whether thecaptured first screen corresponds to the selected template; extractingcontent recognition information from the captured first screen using theselected template, based on determining that the captured first screencorresponds to the selected template; and generate a templatecorresponding to the content service based on a layout of componentsincluded in the captured first screen, based on determining that thecaptured first screen does not correspond to the selected template. 9.The device of claim 8, wherein the processor is further configured torecognize the layout of the components included in the captured firstscreen, wherein the generated template comprises default componentscorresponding to the components, based on the recognized layout.
 10. Thedevice of claim 8, wherein the processor is further configured torecognize a text region included in the captured first screen, whereinthe generated template comprises information about a position andarrangement of the recognized text region.
 11. The device of claim 8,wherein the processor is further configured to: determine the type ofthe content service that provides the content to the device; and basedon a template corresponding to the type of the content service not beingstored in the device, select, as the template used for recognizingcontent, a predetermined template.
 12. The device of claim 8, whereinthe processor is further configured to: detect a text region from thecaptured first screen by using the generated templates; and recognizedisplayed content, based on text detected from the detected text region.13. The device of claim 12, wherein the processor is further configuredto detect content recognition information from the detected text bycomparing the detected text and at least one text included in a meaningrecognition model.
 14. The device of claim 13, wherein the processor isfurther configured to update the generated template based on a displayposition of the content recognition information.
 15. A non-transitorycomputer-readable recording medium on which a program for recognizingcontent is recorded, the program, when executed by a processor of anelectronic device, causing the processor to control the electronicdevice to perform operations comprising: selecting a template used forrecognizing content displayed by the device, based on a type of acontent service that provides the content to the device; capturing atleast a first screen of displayed content and a second screen ofdisplayed content, based on the device receiving a control signal;comparing the captured first screen to the captured second screen todetermine whether the captured first screen corresponds to a gatewaypage comprising content recognition information; and based ondetermining that the captured first screen corresponds to the gatewaypage: determining whether the captured first screen corresponds to theselected template; extracting content recognition information from thecaptured first screen using the selected template, based on determiningthat the captured first screen corresponds to the selected template; andgenerating a template corresponding to the content service, based on alayout of components included in the captured first screen, based ondetermining that the captured first screen does not correspond to theselected template.